N-Way Fail-Over Infrastructure for Survivable Servers and Routers

نویسندگان

Yair Amir

Ryan Caudy

Ashima Munjal

Theo Schlossnagle

Ciprian Tutu

چکیده

Maintaining the availability of critical servers and routers is an important concern for many organizations. At the lowest level, IP addresses represent the global namespace by which services are accessible on the Internet. We introduce Wackamole, a completely distributed software solution based on a provably correct algorithm that negotiates the assignment of IP addresses among the currently available servers upon detection of faults. This reallocation ensures that at any given time any public IP address of the server cluster is covered exactly once, as long as at least one physical server survives the network fault. The same technique is extended to support highly available routers. The paper presents the design considerations, algorithm specification and correctness proof, discusses the practical usage for server clusters and for routers, and evaluates the performance of the system.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

N-Way Fail-Over Infrastructure for Reliable Servers and Routers

متن کامل

Towards a Survivable Security Architecture for Ad-Hoc Networks

We present a security architecture for access control in ad-hoc networks of mobile electronic devices. Ad-hoc networks are formed on demand without support from pre-existing infrastructure such as central servers, security associations or CAs. Our architecture is fully distributed and based on groups and public-key certification. The goal is a survivable system that functions well even when net...

متن کامل

(m, M) Machining system with two unreliable servers, mixed spares and common-cause failure

This paper deals with multi-component machine repair model having provision of warm standby units and repair facility consisting of two heterogeneous servers (primary and secondary) to provide repair to the failed units. The failure of operating and standby units may occur individually or due to some common cause. The primary server may fail partially following full failure whereas secondary se...

متن کامل

Architecture and Execution Model for a Survivable Workflow Transaction Infrastructure

We present a novel architecture and execution model for an infrastructure supporting fault-tolerant, long-running distributed applications spanning multiple administrative domains. Components for both transaction processing and persistent state are replicated across multiple servers, ensuring that applications continue to function correctly despite arbitrary (Byzantine) failure of a bounded num...

متن کامل

Design of survivable IP-over-optical networks

In the past years, telecommunications networks have seen an important evolution with the advances in optical technologies and the explosive growth of the Internet. Several optical systems allow a very large transport capacity, and data tra c has dramatically increased. Telecommunications networks are now moving towards a model of high-speed routers interconnected by intelligent optical core net...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2002

N-Way Fail-Over Infrastructure for Survivable Servers and Routers

نویسندگان

چکیده

منابع مشابه

N-Way Fail-Over Infrastructure for Reliable Servers and Routers

Towards a Survivable Security Architecture for Ad-Hoc Networks

(m, M) Machining system with two unreliable servers, mixed spares and common-cause failure

Architecture and Execution Model for a Survivable Workflow Transaction Infrastructure

Design of survivable IP-over-optical networks

عنوان ژورنال:

اشتراک گذاری